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Abstract — In "On Coding for Reliable Communication over 
Packet Networks" (Lun, Medard, and Effros, Proc. 42nd Annu. 
Allerton Conf. Communication, Control, and Computing, 2004), a 
capacity-achieving coding scheme for unicast or multicast over 
lossy wireline or wireless packet networks is presented. We extend 
that paper's results in two ways: First, we extend the network 
model to allow packets received on a link to arrive according to 
any process with an average rate, as opposed to the assumption of 
Poisson traffic with i.i.d. losses that was previously made. Second, 
in the case of Poisson traffic with i.i.d. losses, we derive error 
exponents that quantify the rate at which the probability of error 
decays with coding delay. 

I. Introduction 

This paper extends [1], which describes a coding scheme for 
reliable communication over packet networks. The most no- 
table features of this scheme are that it is capacity-achieving; 
that intermediate nodes perform additional coding yet do not 
decode nor even wait for a block of packets before sending 
out coded packets; that little explicit network management is 
required, with little or no requirement for feedback or for co- 
ordination; and that all coding and decoding operations have 
polynomial complexity. The scheme therefore differs from 
schemes based on the digital fountain approach (e.g., [2], [3], 
[4]), where intermediate nodes perform no additional coding 
and lower rates are achieved. It also differs from schemes that 
operate in a block-by-block manner (e.g., [5]), where larger 
end-to-end delays are expected. The main result of [1] is that 
the scheme is indeed capacity-achieving under the assumption 
of Poisson traffic with i.i.d. losses. 

It turns out, however, that the scheme is in fact capacity- 
achieving under much more general conditions, and one of 
our extensions is to show that it can achieve capacity when 
the arrival of packets received on a link is described by any 
arrival process with an average rate. No assumptions are made 
on loss correlation or lack thereof — all we require is that a 
long-run average rate of arrivals exists. This fact is particularly 
important in wireless packet networks, where slow fading and 
collisions often cause losses to be correlated in time and 
across separate links. Our other main extension is to derive 
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error exponents in the case of Poisson traffic with i.i.d. losses. 
Having error exponents is important as it allows us to quantify 
the rate of decay of the probability of error with coding delay 
and to determine the parameters of importance in this decay. 

Our work draws upon results from the usually-disparate 
fields of information theory and queueing theory and, to a 
degree, forms a bridge between the two. In particular, we can 
view traditional error exponents as a limiting case in which 
packets arrive in regular, deterministic intervals, and traditional 
queueing as a special case in which no coding is performed. 
Our approach relaxes the constraints in these two limiting 
cases and can be looked upon as a generalization to both. 

II. Network model 

We first describe a model for wireline packet networks. We 
model a wireline packet network as a directed graph Q — 
(TV, A), where Af is the set of nodes and A is the set of arcs. 
Each arc represents a lossy point-to-point link. Some 

subset of the packets injected into arc by node i are 

lost; the rest are received by node j without error The process 
by which packets are injected into an arc is taken as given, 
and it comes about, for example, by using some appropriate 
traffic pattern (e.g., deterministic or Poisson) at a rate that 
is determined by optimizing according to some cost criterion 
over all rate allocations that allow for the desired connection 
to be established (see [6]). 

We denote by Zij the average rate at which packets are 
received on arc (i, j). More precisely, suppose that the arrival 
of received packets on arc (i, j) is described by the counting 
process Aij, i.e. for r > 0, Aij{T) is the total number of 
packets received between time and time r on arc 
Then limi-^oo Aij{T)/T — zij a.s. We define a lossy wireline 
packet network as a pair (C/, z). 

We assume that links are delay-free in the sense that the 
arrival time of a received packet corresponds to the time that 
it was injected into the link. We make this assumption for the 
purpose of simplicity, and it can be relaxed. 

For wireless packet networks, we model the network as a 
directed hypergraph Ti — (TV, A), where TV is the set of nodes 
and A is the set of hyperarcs. A hypergraph is a generalization 
of a graph where generalized arcs, called hyperarcs, connect 
two or more nodes. Thus, a hyperarc is a pair (i, J), where 



i, the head, is an element of Af, and J, the tail, is a non- 
empty subset of Af. Each hyperarc {i, J) represents a lossy 
broadcast link. For each K c J, some disjoint subset of the 
packets injected into hyperarc {i, J) by node i are received by 
all nodes in K without error. 

We denote by Zij^ the average rate at which packets, 
injected on hyperarc {i, J), are received by the set of nodes 
K C J. More precisely, suppose that the arrival of packets 
that are injected on hyperarc {i, J) and received by the set of 
nodes if C J is described by the counting process AijK- Then 
\m\r~,oo AijK{T)/T = ZijK a.s. We define a lossy wireless 
packet network as a pair (H, z). 

III. Coding scheme 

We suppose that, at the source node s, we have K message 
packets wi, m;2, • ■ • , wk, which are vectors of length p over 
the finite field F,. (If the packet length is h bits, then we take 
p = \b/ log2 q\ .) The message packets are initially present in 
the memory of node s. 

The coding operation performed by each node is simple to 
describe and is the same for every node: Received packets 
are stored into the node's memory, and packets are formed 
for injection with random linear combinations of its memory 
contents whenever a packet injection occurs on an outgoing 
hyperarc. The coefficients of the combination are drawn uni- 
formly from Vq. 

Since all coding is linear, we can write any packet x in the 
network as a linear combination of wi, W2, ■ • ■ , wk, namely, 
^ — Y^k^i IkWk- We call 7 the global encoding vector of 
X, and we assume that it is sent along with x, in its header. 
The overhead this incurs (namely, K loga q bits) is neghgible 
if packets are sufficiently large. 

A sink node collects packets and, if it has K packets 
with hnearly-independent global encoding vectors, it is able 
to recover the message packets. Decoding can be done by 
Gaussian elimination. In addition, the scheme can be operated 
ratelessly, i.e. it can be run indefinitely until aU sink nodes in 
T can decode (at which stage that fact is signaled to all nodes, 
requiring only a small amount of feedback). 

IV. Coding theorems 

In this section, we specify achievable rate intervals for the 
coding scheme in various scenarios. The fact that the intervals 
we specify are the largest possible (i.e. that the scheme is 
capacity-achieving) follows from the cut-set bound for multi- 
terminal networks (see [7, Section 14.10]). 

A. Wireline networks 

1 ) Unicast connections: Suppose that we wish to establish 
a connection of rate arbitrarily close to R packets per unit 
time from source node s to sink node t. Suppose further that 



where Q(s, t) is the set of all cuts between s and t, and Tj^{Q) 
denotes the set of forward arcs of the cut Q, i.e. 

r+(g) ■.= {{i,j)&A\ieQ,j^Q}. 

Therefore, by the max-flow/min-cut theorem (see, for example, 
[8, Section 3.1]), there exists a flow vector / satisfying 
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if i = s, 
if i = t, 
otherwise, 
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for all i G Af, and < fij < Zij for all G A. We assume, 
without loss of generality, that / is cycle-free in the sense that 
the subgraph Q' = {J\f,A'), where A' := € A\fij > 

0}, is acyclic. (If Q' has a cycle, then it can be eliminated by 
subtracting flow from / around it.) 

Using the conformal realization theorem (see, for example, 
[8, Section 1.1]), we decompose / into a finite set of paths 
{pi,P2, ■ ■ ■ ,pm}, each carrying positive flow Rm for m = 
1,2,..., M, such fliat J2Z=i Rrn = R. 

Our proof is based on tracking the propagation of what 
we call globally innovative packets. We first consider packets 
received by outward neighbors of s. Suppose packet x is 
received by node j, an outward neighbor of s, at time r. 
We associate with x the independent random variable P^, 
which, for Pm containing arc (s, j), takes the value pm with 
probability (1 — l/q)Rm/ Zgj. We then say that x is globally 
innovative at node j on path pm if Px = Pm- Suppose that 
the coding scheme is run for a total time A, from time till 
time A, and that, in this time, a total of A'' globally irmovative 
packets are received by outward neighbors of s. We call these 
packets vi,V2, ■ ■ • , vn- 

In general, any received packet ?/ in the network is a Unear 
combination of packet received by outward neighbors of s, 
so we can write y = Yln=i l^nVn + E^=jv+i ^n-NVn, where 
vn+i,vn+2, ■ ■ ■ ,vn' denote the packets received by outward 
neighbors of s that are not globally innovative. Since u„ is 
formed by a random linear combination of w\,W2, ■ ■ . , wk, 
we have w„ = J2k=i '^nkWk for n = 1, 2, . . . , A''. Hence, the 
fcth component of the global encoding vector of y is given by 

7fc = En=i l^na-nk + En=jv+i ^n-NC^nk- We caU the vector 
/? associated with y the auxiliary encoding vector of y, and 
we see that any sink that receives K or more packets with 
linear-independent auxiliary encoding vectors has K packets 
whose global encoding vectors collectively form a random 
K X K matrix over Fg, with all entries chosen uniformly. 
If this matrix is invertible, then the sink is able to recover 
the message packets. The probability that a random K x K 
matrix is invertible is nALi(-'^ ^ l/'z'^)> which can be made 
arbitrarily close to 1 by taking q arbitrarily large. Therefore, 
to determine whether a sink can recover the message packets, 
we essentially need only to determine whether it receives K 
or more packets with hnearly-independent auxiliary encoding 
vectors. 

We have so far only defined the term globally innovative for 
packets received by outward neighbors of s. Before defining 



the term for packets received by the remaining nodes, we 
associate with each path pm and each node i on p.,„ the set of 

(v ) 

vectors , which varies with time and which is initially 

empty, i.e. v}^'"\o) :— 0. Whenever a packet x received 
at time t is declared globally innovative at node i on path 
Pm, its auxiliary encoding vector /3 is added to v}'^"^\ i.e. 
v}^"'\t+) := i3UV^^P"'\t). Let C/^P") := f/^"', where j is 
the outward neighbor of s on p,,,,, and let W^^"^^ :— V'/^'"''. 
Now consider node i s and suppose packet x, with auxiliary 
encoding vector (3, is received by node j, an outward neighbor 
of i, at time r. We again associate with x the independent 
random variable Px, but Pr{Px = Pm) = Rm/zij for pm 
containing arc and we say that x is globally innovative 

at node j on path ifP^; = /3 ^ span(Uj^7^M^(P') (A)U 
v}''-\T)UUfl„,^,UM{A)), and \V^'-\t)\ > \vl''-\r)\. 

The definition of globally innovative is devised to sat- 
isfy two conditions: first, that U^^^^''^'"'' (^) linearly 
independent; and, second, that the propagation of globally 
innovative packets through the network is described by a 
queueing network. That the first of these two conditions is 
satisfied can be verified easily: Vectors are added to W'-p^^t) 
only if they are linearly independent of existing ones; vectors 
are added to W^'p^^{t) only if they are linearly independent of 
existing ones and ones in T^(Pi)(A); and so on. The second 
of the two conditions requires more investigation. 

Consider path pm and node i on pm- Let j be the outward 
neighbor of i on pm- Suppose that packet x is received by 
node j from node i at time r and that there are more globally 
innovative packets at i than at j, i.e. \v}^"^\t)\ > |Vj'-^'"-'(t)|. 
Then, because Uj"7^ VF^P' ) (A) U f/^" V) U ufi^^^U^P'\A) 
and W^-^W^P'^A) U f/^-V) U Uf£„+iC/(w)(A) are both 
linearly independent, spSin{U]l^^^W'^P'^A) U v}''"'\t) U 
^fim+iU^'^'H^)) t spaii(U™7iTy(P'HA) U T//P'"^(t) U 
Ufi™+it/'P'HA)), which implies that span(F/^'" V)) t 
span{U™7iM/(P')(A) U f/^'"^(t) U ufi„+i[/(P') (A)). Now, 
the packet a; is a random linear combination of vectors from 
a set that contains V^^^'"''(t), so x is globally innovative with 
some non-trivial probability. This probability can be bounded 
using the following lemma from [9]. We quote the lemma 
without repeating the proof. 

Lemma 1: [9, Lemma 2.1] Let Vi and V2 be two collections 
of vectors from F^, and let /3 be a random linear combination 
of the vectors in Vi, with the coefficients of the combination 
drawn uniformly from Fg. Then 

Pr(/3 ^ span(T/2) | span(Fi) span(F2)) > 1 - -• 

q 

It follows from Lemma ^ that x is globally innovative with 
probability not less than (1 — l/q)Rjn/ Zij. Since we can al- 
ways discard globally innovative packets, we assume that x is 
globally innovative with probability exactly (1 — l/q)Rm/ Zij- 
If instead = |Vj'^'"''(r)|, we see that x cannot be 

globally innovative, and this remains true until another arrival 



occurs at i. Therefore, the propagation of innovative packets 
through node i on path pm can be described as the propagation 
of jobs through a single-server queueing station, where the 
state |V^/'''"''(t)| > |V^''^'"''(t)| corresponds to a non-empty 
queue and the state |V^/^'"''(t)| = | corresponds to 

an empty queue. 

The queueing station is serviced with probabiUty (1 — 
^/q)Rm/ Zij whenever the queue is non-empty and a received 
packet arrives on arc We can equivalently consider "can- 
didate" packets that arrive with probability (1 — l/q)Rml Zij 
whenever a received packet arrives on arc (i, j) and say that 
the queueing station is serviced whenever the queue is non- 
empty and a candidate packet arrives on arc (i, j). Therefore, 
the queueing network that we wish to analyze is one with 
N jobs initially present at node s and with M paths where, 
on each path, jobs are serviced on each arc by the arrival of 
candidate packets. 

We analyze the queueing network of interest using the fluid 
approximation for discrete-flow networks (see, for example, 
[10], [11]). We begin by considering a single path pm- We 
write pra = {ii,i2, ■ • • , jL„,t}, where ii = s. Let Bmi 
and Cmi be the counting processes for the arrival of globally 
innovative packets and candidate packets, respectively, on arc 
k+i) for path Let Q^/"'' (t) be the number of packets 
queued for service at ii at time r when there are iV,„ jobs 
initially present at node s. Hence, for I = 1,2,...,L„, 
Q^r^ = Bm(i^i) ~Bmi, where Bmo{r) := Nm for all r > 0. 
Let C,no{T) ■■= Nm for afl r > 0, Xmi := Cm(i-i) - CmU 
and Ymi := C„/ - Bmi- Then, for ? = 1, 2, . . . , 

Q^^r'^ =Xml-Ymil-l)+Yml. (1) 

Moreover, we have 

Q''mr\r)dYmi{T)^Q, (2) 
dYml{T) > 0, (3) 

and 

Q';nr\r)>0 (4) 
for all r > and 1 = 1,2,..., L„i, and 

Yrnlir) = (5) 

for alH = 1, 2, . . . , Lm- 

We observe now that equations Q-Q give us the condi- 
tions for a Skorohod problem (see, for example, [10, Section 
7.2]) and, by the oblique reflection mapping theorem, there 
is a well-defined, Lipschitz-continuous mapping $ such that 
gf^") = ^Xm). 

Let Cl^r\^) ■■= Cmi{Nmr)/Nm, Xl^r\^) ■= 
Xmi{NmT)/Nm, and Q|^"V) := Q''mr\N^nT)/Nm. Re- 
call that Aij is the counting process for the arrival of received 
packets on arc Therefore, C'^'"''(t) is the sum of 



^iiH+ii''') Bernoulli-distributed random variables with param- 
eter (1 - l/q)Rrn/zi^i^^^. Hence 



Cmiir) := lim 



+ 



1 (1 - llq)Rrr. , . 

= {l-l/q)RmT a.s., 

where the last equality follows by the assumptions of the 
model. Therefore, 

X^i{t):= lira X^-Hr) 

Nm—>-00 

{l-{l-l/q)R„,T if/ = l, 







Otherwise 



By the Lipschitz-continuity of $, then, it follows that Qm '■— 

VanN^^ooQm"^^ = ^{Xm), i.e. Qm is, almost surely, the 
unique Qm that satisfies, for some Ym, 

r, .^ /l-(l-l/9)i?™T + Y™i(T) if/-l, 
I - i;„(,_i)(r) otherwise, 



dYmAT) > 0, 



and 



Qmiir) > 
for all T > and I ~ 1,2, ... , Lm, and 

for alH = 1, 2, . . . , L^. 

A pair (Qm,^™) that satisfies (l6l- J10> is 

'(l-(l-l/g)i?,„T)+ if/ = l, 



(r) 







otherwise, 



(7) 
(8) 

(9) 
(10) 



(11) 



and 



Y„At) = (1 - (1 - l/q)RmTy . 

Hence Qm is given by equation il 1> . 

Recall that the sink can recover the message packets 
with high probability if it receives K or more packets 
with linearly-independent auxiliary encoding vectors and that 
Um=iW^P"^\A) is linearly-independent. Therefore, the sink 
can recover the message packets with high probability if it 
has K or more globally innovative packets at t on any path 
Pm- Let 1/ be the number of globally innovative packets at 
t at time A. Then > N ~ J^^^i E/t"! where 
Nm = ani(A)- Take A = iV/(l - l/q)R. Then 

M L 



lim — > lim 1 — — —Q^, 



rn=l 1=1 



Nm{l - l/q)R 



1 



a.s., 



since limyv^oo Nm/N = Rm/R a.s. 

Take K = \NRc], where < Rc < 1. Then, for N 
sufficiently large, v > K with probability arbitrarily close 



to 1 . So the probability of error can be made arbitrarily small, 
and the rate achieved is 



- = -(1 
A 



l/q)R > Rail - l/q)R, 



which can be made arbitrarily close to R. This concludes the 
proof for unicast connections. 

2) Multicast connections: The proof for multicast connec- 
tions is very similar to that for unicast connections. In this 
case, rather than a single sink t, we have a set of sinks T. 
And we suppose 



R < min min 

teT gea(s,t) 



Therefore, by the max-flow/min-cut theorem, there exists, for 
all t e T, a flow vector satisfying 



E 



At) _ 

J ij 



bl(«j)e-4'} 
for all i e TV, and 



{J\U,^)eA'} 



At) ^ 

J ii 



R if i = s, 
-R if i = t, 
otherwise, 



^ --ij for all e A'. 
For each flow vector f^*\ we go through the same argu- 
ment as that for a unicast connection, and we find that the 
probability of error at every sink node can be made arbitrarily 
small by taking N and q sufficiently large. 

We summarize our results regarding wireline networks with 
the following theorem statement. 

Theorem 1: Consider the lossy wireline packet network 
{Q, z). The random linear coding scheme we describe is 
capacity-achieving for multicast connections, i.e. it can 
achieve, with arbitrarily small error probability, a multicast 
connection over the network from source node s to sink nodes 
in the set T at rate arbitrarily close to R packets per unit time 
if 



R < min min 

t£T QeQ{s,t) 



E • 

(,:j)er+(Q) 



B. Wireless packet networks 

The wireless case is actually very similar to the wireline 
one. The main difference is that the same packet may be 
received by more than one node, so we must specify which 
node considers the packet to be globally innovative. With an 
appropriate redefinition of globally innovative, we obtain the 
following theorem. We omit the details of the development for 
the sake of brevity. 

Theorem 2: Consider the lossy wireless packet network 
(TCjz). The random linear coding scheme we describe 
is capacity-achieving for multicast connections, i.e. it can 
achieve, with arbitrarily small error probability, a multicast 
connection over the network from source node s to sink nodes 
in the set T at rate arbitrarily close to R packets per unit time 
if 



R < min min 

*6T QeQ{s,t) 



E E 

(t,J)£r+{Q)K(^Q 



ZiJK 



V. Error exponents for Poisson traffic with i.i.d. 

LOSSES 

We now look at the rate of decay of the probabiHty of 
error pe in the coding delay A. In contrast to traditional error 
exponents where coding delay is measured in symbols, we 
measure coding delay in time units — time t = A is the time 
at which the sink nodes attempt to decode the message packets. 
The two methods of measuring delay are essentially equivalent 
when packets arrive in regular, deterministic intervals. 

We specialize to the case of Poisson traffic with i.i.d. losses. 
Hence, in the wireline case, the process Aij is a Poisson 
process with rate and, in the wireless case, the process 
AijK is a Poisson process with rate Zijx- 

Consider the unicast case for now, and suppose we wish to 
establish a connection of rate R. Let C be the supremum of all 
asymptotically-achievable rates. We consider the limit where 
q, the coding field size, is infinitely large, thus implying large 
coding complexity and packet size. 

We begin by deriving an upper bound on the probability 
of error. To this end, we take a flow vector / from s to t of 
size C and develop a queueing network from it that describes 
the propagation of globally innovative packets. This queueing 
network now becomes a Jackson network. Suppose that N, the 
number of jobs initially present at s, is infinitely large. Then, 
as a consequence of Burke's theorem (see, for example, [12, 
Section 2.1]) and the fact that the queueing network is acyclic, 
the arrival and departure processes at all nodes are Poisson in 
steady-state. 

Let 5't(m) be the arrival time of the mth globally innovative 
packet at t. Then, we have, for 6 < C, 



1 



C 



lim — logE[exp(6'5't(m))] = log ■ 

m-*oo m o — I 



(12) 



since, in steady-state, the arrival of globally innovative packets 
at t is described by a Poisson process of rate C. If an error 
occurs, then fewer than [i?A] globally innovative packets are 
received by t by time r = A, which is equivalent to saying 
that ^'t([i?A]) > A. Therefore, 

Pe<Pr{^'ti\RA])> A), 

and, using the Chernoff bound, we obtain 

Pe< mill cxp(-6'A + logE[cxp(6'*t(ri?Al))l). 

o<e<c 

Let e be a positive real number Then using equation (I12> we 
obtain, for A sufficiently large, 

C 



Pe < min exp -6 A + RA i log ■ 

o<e<c \ { C - 6 

= exp(-A(C - i? - Rlog{C/R)) + RAe). 
Hence, we conclude that 

logPe 



lim ■ 

A— »oo 



A 



> C - R - R\og{C/R). 



(13) 



For the lower bound, we examine a cut whose flow capacity 
is C. We take one such cut and denote it by Q* . It is clear 
that, if fewer than [i?A] distinct packets are received across 



Q* in time t = A, then an error occurs. For both wireline and 
wireless networks, the arrival of distinct packets across Q* is 
described by a Poisson process of rate C. Thus we have 



ri?Ai-i 



Pe > exp(-CA) ^ 



1=0 



{CAY 
II 



> exp(-CA) 



(CA) 



\RA]-1 



ri\RA]) ' 

and, using Stirling's formula, we obtain the reverse inequality 
to il3i . Therefore, 

logPe 



lim ■ 

A^oo 



= C - R- R\og{C/R). 



(14) 



Equation defines the asymptotic rate of decay of the 
probability of error in the coding delay A. This asymptotic 
rate of decay is determined entirely by R and C. Thus, for 
a packet network with Poisson traffic employing the coding 
scheme described in Section|n]with large field size q, the flow 
capacity C of the minimum cut of the network is essentially 
the sole figure of merit of importance in determining the 
effectiveness of the coding scheme for large, but finite, coding 
delay. Thus, in deciding how to inject packets to support the 
desired connection, we need only consider this figure of merit. 

Extending the result from unicast connections to multicast 
connections is quite straightforward. We again obtain equation 
(I14> in the multicast case. 
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